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Abstract 


We  present  an  observer  design  for  systems  with  controlled  nonlinear  dynam¬ 
ics  and  nonlinear  observation.  The  design  is  a  development  of  earlier  work, 
which  was  motivated  by  nonlinear  filtering  asymptotics.  The  basic  design  re¬ 
quires  that  the  initial  conditions  belong  to  a  bounded  region  determined  by  the 
data  and  design  parameters.  However,  for  a  certain  class  of  systems,  no  such 
a  priori  knowledge  is  required.  To  illustrate  the  utility  of  our  design,  several 
examples  are  given. 

Key  words:  Nonlinear  systems,  observers. 


1  Introduction 


In  this  paper  we  present  a  design  for  an  observer  for  the  nonlinear  control  system 

x  =  f(x,u),  x(0)  =  x0,  (1) 

y  =  h(x) 

where  x  G  Mn,  u  G  lRm,  |u,|  <1  i  =  and  y  G  TRP .  The  initial 

condition  Xo  is  unknown. 

The  observer  problem  consists  of  recursively  computing  an  estimate  z(t)  of 
x(t)  for  which  the  error  decays  to  zero  els  t  — ■>  oo.  That  is,  to  design  a  system 

F(m,u,y),  m(0)  —  m0,  (2) 

G(m ) 


m  = 


z  = 


such  that 


lim 

<—►00 


|z(t)  -  z(t)  |  =  0  (3) 

for  all  x0  in  a  suitable  class  I .  Here  I  represents  a  priori  knowledge  concerning 
the  initial  condition  Xo- 

We  prove  the  following  result  for  our  observer  design:  provided  that  we  have 
some  knowledge  of  x0  (in  the  form  \xQ  —  zQ\  <  p,  where  z0  is  the  initial  estimate) 
and  assuming  that  (l)  satisfies  a  detectability  condition,  then  the  observer  es¬ 
timate  z(t)  converges  exponentially  to  the  system  trajectory  x(t)  as  t  — *•  oo 
(Theorem  2.1).  The  radius  of  convergence  p  depends  on  the  nonlinearities  in 
the  dynamics  and  observations  as  well  as  on  certain  design  parameters.  For  a 
certain  class  of  systems,  no  knowledge  of  Xq  is  required  (Corollary  2.1). 

Our  design  is  a  development  of  the  design  given  in  Baras,  Bensoussan  and 
James  [l],  which  treats  systems  with  uncontrolled  nonlinear  dynamics  and  linear 
observations.  The  main  contributions  here  are  the  results  for  nonlinear  obser¬ 
vations  and  controlled  dynamics.  We  remark  that  these  designs  do  not  involve 
coordinate  transformations,  canonical  forms,  local  linearization,  etc,  and  seem 
robust  when  compared  with  other  designs.  However,  the  designs  do  involve 
solving  Riccati  equations  and  computing  certain  matrices  and  constants. 

In  Section  2  we  give  the  observer  design  and  state  the  main  convergence 
results.  The  design  involves  Riccati  differential  equations  with  time-varying  co¬ 
efficients,  and  in  Section  3  we  obtain  bounds  on  the  solutions  of  these  equations 
under  certain  detectability  and  rank  conditions.  These  bounds  are  used  in  Sec¬ 
tion  4  to  prove  the  convergence  result.  Finally  in  Section  5  we  give  several 
examples. 


2  Observer  Design 

We  assume  that  /,  h  are  smooth  with  bounded  derivatives  of  orders  1  and  2.  Let 
N  G  L(lRn,  IRn ),  R  G  L(MP,  1RP)  and  assume  rank  N  —  n  and  R  >  0.  Assume 


1 


1 u(t)  is  continuous. 

Write  A(x,u)  —  Df(x,u),  H(x)  =  R~xDf i(x),  where  D  denotes  gradient  in 
the  x  variable.  Set 


||A||  =  sup{||A(x,u)||  :  x  e  lRn,  |u<|  <  1} 

and  similarly  define  ||iif||,  and  so  on. 

Consider  the  coupled  system 


rh(t) 

=  f(m(t),u{t))  +  Q(t)P(m(t))'P-1(y(t)  -  h(m(t)) 

(4) 

m(  0) 

=  m0 

m 

=  A(m(t),u(t))Q(t)  +  Q(t)A(m(t),u(t))' 

-Q(t)H{m{t))'H(m{t))Q{t)  +NN' 

(5) 

<?(o) 

=  Q0>  0. 

This  is  our  observer  for  (1).  It  is  essentially  a  modification  of  the  deterministic 
or  minimum  energy  estimator,  as  discussed  in  Baras,  Bensoussan  and  James 
[l].  Note  in  particular  that  the  Riccati  differential  equation  (5)  depends  on  the 
control.  This  is  not  necessary  when  f(x,u)  =  f(x)  -f-  Bu  :  set  A(x)  =  Df(x). 

Refering  to  Section  3,  we  will  assume  that  the  pair  {H(x),  A(x,  u))  is  uni¬ 
formly  detectable.  Since  N  has  rank  n  and  ||.A||  <  oo,  the  pair  (A(x,u),N)  is 
uniformly  stabilisable  (refer  to  Section  3),  and  NN'  >  r0I  for  some  tq  >  0.  Let 
P0  =  Q-1,  P(t)  =  and  let  p,q  be  the  bounds  for  ||P(t)|j,  ||Q(t)||  (given 

in  Section  3). 

Regard  A0,N,R  as  design  parameters.  Define  p  =  p(Q0,N,R)  by 

p  =  ~ rT7~njT7  (s/rWfW  +  V«ll-R"‘ir  l|Ofc||  (6) 

92IKo'  II 

Our  main  theorem  is  the  following  convergence  result,  similar  to  Theorem  8  in 

[11- 


Theorem  2.1  Assume  there  exists  Qo,N,R  such  that 

1*0  -  mol  <  p(Qo,  N,  R)  (7) 

Then  the  system  (4),  (5)  is  an  observer  for  the  nonlinear  control  system  (1) 
provided  that  {H(x),  A{x,  u))  is  uniformly  detectable  and  the  above  assumptions 
hold.  That  is,  there  exists  contants  K  >  0,7  >  0  such  that 

|x(t)  -  m(t)|  <  K |x0  -  m0\e~'1*  (8) 


for  all  t  >  0. 
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Remark  There  is  a  trade-off  between  the  decay  rate  7  =  7  (Q0,  N,  R)  and  the 
radius  of  convergence  p.  The  designer  can  optimize  his  choice  of  7 ,p  by  varying 
the  design  parameters.  /// 

By  using  different  estimates  for  the  nonlinearities,  we  obtain  an  observer  for 
(1)  without  any  contraints  on  the  initial  conditions  x0,  m0  for  a  class  of  systems 
Included  in  this  class  are  systems  for  which  A(x,u)  is  uniformly  negative  definite 
(see  the  example  in  Section  5.2). 

Define  6  =  S(Qo,N,  R)  by 

«  =  ^-4p||£l/j!-4||fl-1||!||Z)A||I.  (9) 

If  D2f  or  D2h  is  zero,  we  omit  the  corresponding  term  from  (9). 

Corollary  2.1  Assume  there  exist  Q0,N,R  such  that 

0<6{Q0,N,R).  (10) 

Then  the  system  (4),  (5)  is  an  observer  for  the  control  system  (1)  provided  that 
(H(x),A(x,u))  is  uniformly  detectable  and  the  above  assumptions  hold.  That  is, 
there  exists  contants  K  >  0,7  >  0  such  that 

\x(t)  -  m(t)\  <  K\x0  -  m0\e~',t  (11) 

for  all  t  >  0  and  all  x0,  m0  G  JRn. 

Remark  Our  design  can  readily  be  extended  to  time  varying  systems.  /// 

3  Riccati  Equations 

Write  X  =  2Rn  X  [—1,  l]m  and  £  =  (x,  u)  G  X.  If  t  >->  =  (xt,  ut )  is  a  continuous 

curve,  we  write  At  =  .A(£t)  =  i4(xt,ut),  etc. 

Consider  the  Riccati  differential  equations 

Qt  =  AtQt  +  QtA't  -  QtH'tHtQt  +  NN'  (12) 

Pt  =  —PtAt  -  A'tPt  -  PtNN'Pt  +  H'tHt  (13) 

Qo  =  Po'>0 

Existence  and  uniqueness  for  (12),  (13)  are  standard,  and  note  that  Pt  =  Qf1. 

3.1  Uniform  Detectability  and  Stabilisability 

In  this  section  we  present  sufficient  conditions  that  ensure  boundedness  of  the 
solutions  of  the  Riccati  equations  (12),  (13).  The  bound  for  ||  Qt  ||  requires  a 
detectability  condition  which  we  now  define. 
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Definition  The  pair  of  matrices  (#(£),  -4(f))  is  uniformly  detectable  if  there 
exist  a  constant  ao  >  0  and  a  bounded  continuous  matrix  valued  function  A(£) 
such  that 

y'iMt)  +  A(£)tf(£))  V  <  -ao  |  r)  I2  (14) 

for  all  r)  G  lRn,  (el, 

This  condition  is  similar  to  the  well  known  detectability  condition  for  linear 
time-invariant  systems.  The  pair  of  matrices  (C,  A)  is  detectable  if  there  exists 
a  matrix  A  such  that  the  eigenvalues  of  A  +  AC  have  strictly  negative  real  parts; 
uniform  detectability  implies  detectability,  but  not  conversely.  A  disadvantage 
of  this  condition  is  that  it  is  in  general  difficult  to  check,  and  A(x,  u )  may  be 
hard  to  compute.  No  simple  rank-type  condition  exists  to  date.  In  the  case  that 
H(x)  is  uniformly  of  full  rank ,  that  is, 

H{x)'H{x)  >  s0I,  (15) 

for  some  so  >  0,  it  is  possible  to  bound  ||  Qt  ||  without  using  (14). 

To  obtain  a  uniform  bound  for  ||  Pt  ||,  we  assume  that  rankiV  =  n  and  use  the 
following  uniform  stabilisation  result,  based  on  Kalman  [2].  Let  $p(t,to)  denote 
the  fundamental  transition  matrix  corresponding  to  a  time  varying  matrix  Ft. 
Recall  NN'  >  r0I. 


Lemma  3.1  Assume  rankN  —  n.  Consider  the  control  system 

zt  =  —Atzt  —  Nut,  z(0)  =  z,  (16) 

where  At  =  A(£*)  for  some  curve  t  i— >  £*.  Then  there  exists  a  feedback  control 
u°t  =  Ttzt  such  that 

GXP  (-2^(<-*°))  ’  ^ 
for  t  >  to  >  0,  where  At  —  —At  —  NTt  and  for  any  a  >  0, 

=  ae-^>A>  (l  +  ||  N  ||!)'' , 


AW  =  • 

II  r,  ||<||  n  II  AW  =11  r  II . 
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Proof  Consider  the  optimal  control  problem  of  minimising  the  cost 

Ju{t 0,2)  =  f  ||  zt  ||2  +  ||  ut  ||2  dt 

Jto 

over  controls  u  for  the  system  (16).  Define  the  controllability  grammian 

fto+a 

C (to,  +  <7)  =  /  $-A(t0,t)NN' $'_A (t0 ,t)dt. 

Jto 

Note  that  ||  $-A(t,t0)  ||2<  e2UAH,  and 

r0c re“2||A|1/  <  C(t0,t0  +  a)  <||  N  ||2  <7e2||x|lJ 

so  that  the  system  (16)  is  uniformly  completely  controllable  [2], 

The  value  function  is  given  by 

V{t0,z)  =  ^ z'Ztoz , 
where  Zt  solves  the  Riccati  equation 

Zt  =  AtZt  +  ZtA[  +  ZtNN'Zt  -  I. 

The  optimal  control  is 

u°  =  -N'ZtZt. 

Now 

/•<o+c 

V(t0,z)  >  J*  II  Zt  ||2  dt 

>  f30(o)  \z\2  . 


Also,  using  the  control 

u]  =  N'$-A{to,t)C{t0,t0  +  a)~lz. 


we  obtain 

V(t0,z)  <  Pi{cr)  |  -3  |2  • 

Finally,  note  that  V[t,z)  is  a  Lyapunov  function  for 


z°t  =  -(At  +  iVrt)<; 


whence  (17). 
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3.2  Bounds 

Theorem  3.1  Assume  that  £  A(£),H(£)  are  continuous  and  bounded, 

(ff(£),  .<4(£))  is  uniformly  detectable,  and  rankN  =  n.  Then  we  have 


WQtW  < 

1 1  1 1  — 


||Qo||  + 

'A> 


+ 


Po  + 


2  oto  , 

l#ll2  +  liri 


20o 


=  q  <  oo, 


=  p  <  oo. 


(18) 

(19) 


These  bounds  are  independent  of  T  >  0. 


Note  The  bound  q  depends  on  the  choice  of  A,  while  p  depends  on  a.  To 
obtain  the  best  bound,  one  can  optimise  over  these  parameters.  For  linear 
time-invariant  detectable  systems,  one  can  also  obtain  a  bound  for  ||  Qt  ||.  /// 


Proof  We  modify  an  argument  in  [1].  To  prove  (18),  consider  the  following 
linear  optimal  control  problem  with  time- varying  coefficients: 

-  Vt  =  A'trit  +  H'tvt,  t)T  =  h,  (20) 

where  h  G  IRn  is  given  and  v  is  the  control.  The  cost  functional  is 

Ji{v,T)  =  ri'oQom  +  f  ( v'tvt  +  r]'tNN'r]t)dt.  (21) 

Define  a  value  function 


V1{h,T)  =  mi{J1{v,T):tlT  =  h} 

The  Hamilton- Jacob i-Bellman  (HJB)  equation  is 

+  max]  DM- A',  -  H[v)  -  v2  -  V'NN'„\  =  0 
Let  Qt  be  the  solution  of  (12).  Then 

v{v,t)  =  v'QtV 

is  the  unique  (viscosity)  solution  of  (13)  with  V(rj, 0)  =  p'QoV- 
Consider  the  (suboptimal)  feedback  control  law 

v(t)  =  A'trjt- 


Then  by  (20), 


-  Vt  =  {A't  +  H'tA’t)rit ,  rjT  -  h. 


(22) 
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Then  we  have 


V(rj,T)  =  h'Qrh  <  rjoQorjo  +  f  r)'t(NN'  +  AtA|)r?t 

Jo 

Now  using  (22), 

\rio\2-2fTv,t(A,t  +  H,tA't)Vtdt=\h\2 

Jo 

Hence  using  uniform  detectability  (14),  |?7o | 2  <  |A|2  and 

jT 

Jo  2a0 

Combining  this  with  (23)  we  obtain 


h'QTh  <  h'  HQoll  + 


IlN'H2  +  ||  A" 2 


which  proves  (18). 

Similarly,  to  prove  (19),  consider  the  optimal  control  problem 


with  cost 


Af  —  At  Xt  +  Nvt,  A t  —  h 


J2{v,T)  =  A'  P0  A  +  fT(v't  vt  +  A '  H[  Ht  A t)dt. 
Jo 


Define  the  value  function 


V2{h,T)  =  inf{J2(v,T)  :  Ar  =  h}. 


Then  the  HJB  equation 


— —  V2  +  ma x[D\  V2(^4tA  +  Nv)  —  it  —  A'  H[  Ht  A]  =  0, 
oT  v 

with  V2(A,0)  =  A'  Pq  A  and  solution  V2(X,t)  =  X'  Pt  A,  where  Pt  is  the  solution  of 
(13).  By  Lemma  3.1,  set 

v{t)  =  Tt  Xt, 

then  by  (24) 


Thus 


At  —  ( At  -t-  NTt)i  Xt  —  h. 


h! Pjh  <  Aq  Pq  Ao  +  [  X't  (rj  Tt  +  H't  Ht)Xt  dt 
Jo 


At  =  $A(-t,-T)A, 
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and  hence  [  A0  |2<  ^  |  h  |2  and 


This  together  with  (26)  yields  (19). 


I 


Corollary  3.1  Assume  that  £  h->  A(£),  H(£)  are  bounded  and  continuous, 
and  that  H(£)  is  uniformly  of  full  rank.  Then 


N  ||2  +  ||  A 

2o:o 


=  q  <  oo, 


(27) 


for  all  T  >  0,  where  for  any  t  >  0, 

oo(t)  =  re-2Tl!AH  (l  +  rV'M  ||  N  ||2)_1 , 

N  ||2\ 


ai(r)  =  re4rll'AH  ( 1  + 


r0r 


At  ||<||  H  ||  ckx(t-)  =||  T  ||  . 


Proof  Consider  the  control  system 

it  =  A'tzt  +  H'tut,  z{ 0)  =  (28) 

where  At  =  A(£t)  for  some  curve  t  Define  the  grammian 

fto+T 

0(to,to  +  t)  =  /  $Ai(to,t)H'tHt$Ai(ta,t)dt. 

Jto 

Now  ||  $a'(Mo)  ||2<  e2liAH,  and 

so re~2^I  <  0{t0,t0  +  r)  <||  H  ||2  re2^I 

so  that  the  system  (28)  is  uniformly  completely  controllable  [2].  Now  proceed 
as  in  the  proof  of  Lemma  3.1. 

I 
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4  Asymptotic  Convergence 

Using  the  bounds  (18),  (19)  we  prove  Theorem  2.1,  and  Corollary  2.1. 


Proof  of  Theorem  2.1  The  error  e[t)  =  x(t)  —  m(t )  satisfies 

e(t )  =  f{x(t),u(t))  -  f{m(t),u(t))  -  Q(t)if(mt)'i2_1(y(t)  -  h(m(t))) 

=  [A(m(t),u(t))-Q(t)H(m(t))'H(m(t))}e(t) 

+  [f(x(t),u(t))  -  f(m(t),u(t))  -  Df(m(t),u(t))e(t )] 

-Q{t)H{m{t))' R~l[h(x(t))  -  h(m(t))  -  Dh(m(t))e(t)} 

Therefore  using  the  Riccati  equation  (13)  for  P(t ), 

i-dt)' P(t)e(t)  =  -e(tyP(t)NN'P(t)e(t)-C(tyH(m(t)yH(m(t))e(t) 

at 

+2 e(t)'P(t)  [  [  rD2 f(m(t)  +  rse(t),u(t))e(t)2  drds 
Jo  Jo 

—2e(t)'H(m(t))'R~1[  [  rD2h(m(t)  +  rse{t))e{t)2  drds 
Jo  Jo 

<  C(t)'(-ro/i  +  \P(t)h(t)\{^\\D*f\\  (29) 

+V/«||B-||*  ll™ll  ||C*A||])  e(t) 

Let  C  =  (v/p||L>2/||  +  VqWR^W2  \\Dh\\  \\D2h\\).  By  hypothesis  (7)  we  have 

+  |P0*«o|C  <  0. 

q2 

Since  P(t)%e(t)  is  continuous,  there  is  an  interval  [0,t0)  such  that 

-~  +  |P(t)5e(0|C<°  on  [0,<0). 

But  then  (29)  implies 

^-\P{t)*e{t)\2  <  0  on  [(Mo), 
at 

and  thus  l 

|P(0*«WI  <  l^o’eol 

for  t  e  [0,  t0)-  By  continuity  this  inequality  holds  for  t  €  [0,t0].  Hence  we  can 
proceed  from  to  on.  Thus  there  exists  6  >  0  such  that 

|P(0M<)l<5(p-«) 
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This  system  is  controllable  and  observable.  However,  the  pair  of  matrices 
{Dh{x),  A)  is  not  observable  for  xi  =  k*,  where  k  is  an  odd  integer.  The  system 
has  eigenvalues  -1,-2  and  A  is  symmetric,  hence  ( Dh(x),A )  is  automatically 
uniformly  detectable,  with  a0  =  1,  A(x)  =  0.  Let  R  =  rl,  N  =  y/fol,  Qo  —  7 2I. 
Here,  H{x)  =  ^(cosxi,0).  Now 

6  =  ro(72  +  ro/2)~2  -  4r-2. 

Set  r  =  3,  r0  =  0.2,  7  =  0.1.  Then  6  -  7.82. 

The  observer  for  (32)  is 

m{t)  =  Am(t)  +  Bu(t)  +  H(m(t))'  (y(t)  —  sinmi(t)),  (33) 

Q(t)  =  AQ(t)  +  Q(t)A‘  -  Q{t)H(m(t))'H{m{t))Q{t)  +  0.27. 

By  Corollary  2.1,  m(t)  converges  exponentially  to  x(t)  for  all  xo,m0  £  JRn. 
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5  Examples 

We  now  give  some  simple  examples  to  illustrate  the  applicability  of  the  above 
observer  design. 

5.1  Bilinear  Dynamics,  Linear  Observation 

Consider  the  general  bilinear  system 

m  \ 

A  +  UiBi  J  x,  x(0)  =  x0,  (30) 

«=i  / 

y  =  Cx. 

We  assume  |u,-|  <  1,  p  =  1,  and  here  (  =  u£  [—1,  l]m  =  X.  Write 

m 

A(u)  =  A  +  Y^UiBi. 

t=i 

Define,  for  r  >  0,  the  observability  grammian 

ft0  +  T 

0{t0,to  +  r)  =  /  $A'{to,t)C'C&A,{to,t)dt, 

Jto 

where  At  =  ^4(u(t)).  Assume  that  (30)  is  uniformly  observable  in  the  sense  that 
there  exists  r  >  0  such  that  for  all  to  >  0 

lo {t)I  <  0{t0,t0  +  r)  <  7i (r)J 

for  constants  '7o(^‘)»  7i(r)  >  0>  independent  of  the  control.  The  we  can  bound 
||  Qt  ||  as  in  Corollary  3.1. 

Then  the  following  system  is  an  observer  for  (30),  with  no  contraints  on  the 
initial  conditions. 

ih(t)  =  A(u(t))m(t)  +  Q{t)C'(y(t)  -  Cm{t)),  m{ 0)  =  m0,  (31) 

Q(t)  =  A{u{t))Q(t)  +  Q{t)A{m{t))' -Q{t)C'CQ{t)  +  I,  Q0  =  I. 

For  simplicity  we  have  taken  Q0,N,R  to  be  identity  matrices.  To  improve  the 
decay  rate  '7,  one  could  try  other  values  for  Qo,  N,  R. 

Compare  this  design  with  the  design  for  linear  time-varying  systems  in 
Willems  and  Mitter  [5],  and  O’  Reilly  [4]. 

5.2  Linear  Dynamics,  Nonlinear  Observations 

Consider  the  system 


y  -  sinxi. 
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This  system  is  controllable  and  observable.  However,  the  pair  of  matrices 
(Dh(x),  A)  is  not  observable  for  27  =  where  k  is  an  odd  integer.  The  system 
has  eigenvalues  —  1,—  2  and  A  is  symmetric,  hence  ( Dh(x),A )  is  automatically 
uniformly  detectable,  with  ao  =  1,  A(x)  =  0.  Let  R  =  rl,  N  =  y/r^I,  Q0  =  72/. 
Here,  H(x )  =  ^(cosxi,0).  Now 

6  =  ro ('T'2  +  r0/ 2)-2  -  4 r-2. 

Set  r  =  3,  ro  =  0.2,  7  =  0.1.  Then  6  =  7.82. 

The  observer  for  (32)  is 

m(t )  =  Am(t)  +  Bu(t)  +  H(m(t))'  (y(t)  —  sinm^t)),  (33) 

O 

Q(t)  =  AQ{t)  +  Q{t)A'  -  Q{t)H{m(t))'H{m(t))Q{t)  +0.2/. 

By  Corollary  2.1,  m{t)  converges  exponentially  to  x(t)  for  all  xo,mo  £  lRn. 
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